DocuBurst: Visualizing Document Content using Language Structure

نویسندگان

  • Christopher Collins
  • M. Sheelagh T. Carpendale
  • Gerald Penn
چکیده

Textual data is at the forefront of information management problems today. One response has been the development of visualizations of text data. These visualizations, commonly based on simple attributes such as relative word frequency, have become increasingly popular tools. We extend this direction, presenting the first visualization of document content which combines word frequency with the human-created structure in lexical databases to create a visualization that also reflects semantic content. DocuBurst is a radial, space-filling layout of hyponymy (the IS-A relation), overlaid with occurrence counts of words in a document of interest to provide visual summaries at varying levels of granularity. Interactive document analysis is supported with geometric and semantic zoom, selectable focus on individual words, and linked access to source text.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DocuBurst: Document Content Visualization Using Language Structure

We present the first visualization of document content which takes advantage of the human-created structure in lexical databases. We use the WordNet hyponymy (IS-A) relationship as the structure for radial, space-filling trees which quickly reveal the concepts contained within a document of interest. Interactive techniques of zoom, filter, and details-on-demand support document analysis. The vi...

متن کامل

DocuBurst: Radial Space-Filling Visualization of Document Content

We present the first visualization of document content which takes advantage of the human-created structure in lexical databases. We use an accepted design paradigm to generate visualizations which improve the usability and utility of WordNet as the backbone for document content visualization. A radial, space-filling layout of hyponymy (IS-A relation) is presented with interactive techniques of...

متن کامل

An Analysis of Ministry of Education’s Strategic Plans Based on Favorable Components of English Language Teaching Using Shannon’s Entropy

The present research aims to analyze the content of Ministry of Education’s strategic plans (the Fundamental Reform Document of Education, the Comprehensive National Scientific Plan and the National Curriculum Document) based on Shannon's entropy regarding the favorable components of teaching English. The contents of the Fundamental Reform Document of Education, the Comprehensive National Scien...

متن کامل

Writers on the Move: Visualizing Composing Processes Involved in Academic Writing

The present research study aimed to explore covert processes of editing and revision which were involved in writing four different academic text genres (i.e. abstract, conclusion, data commentary, and cover letter) in English language. To this end, six EFL learners with Persian as their mother were recruited to participate in this study. All the participants attended an induction session and ea...

متن کامل

بررسی استانداردهای ساختار، محتوا و واژه‌نامه پرونده الکترونیک سلامت در سازمان‌های منتخب و ارائه الگوی مناسب برای ایران

Introduction: Electronic health record (EHR) is defined as digitally stored healthcare information about an individual's life time with the purpose of supporting continuity of care, education, and research. Major issue that needs to be addressed in order to accomplish with sharing and exchange is the development and use of content and structure standards in the EHR. Based on, this investigation...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Comput. Graph. Forum

دوره 28  شماره 

صفحات  -

تاریخ انتشار 2009